Picture for Wayne Xin Zhao

Wayne Xin Zhao

SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training

Add code
Feb 03, 2026
Viaarxiv icon

SWE-World: Building Software Engineering Agents in Docker-Free Environments

Add code
Feb 03, 2026
Viaarxiv icon

ForesightKV: Optimizing KV Cache Eviction for Reasoning Models by Learning Long-Term Contribution

Add code
Feb 03, 2026
Viaarxiv icon

Adaptive Ability Decomposing for Unlocking Large Reasoning Model Effective Reinforcement Learning

Add code
Jan 31, 2026
Viaarxiv icon

RecNet: Self-Evolving Preference Propagation for Agentic Recommender Systems

Add code
Jan 29, 2026
Viaarxiv icon

GenCI: Generative Modeling of User Interest Shift via Cohort-based Intent Learning for CTR Prediction

Add code
Jan 26, 2026
Viaarxiv icon

MergeMix: Optimizing Mid-Training Data Mixtures via Learnable Model Merging

Add code
Jan 25, 2026
Viaarxiv icon

LLM-in-Sandbox Elicits General Agentic Intelligence

Add code
Jan 22, 2026
Viaarxiv icon

Controlled LLM Training on Spectral Sphere

Add code
Jan 13, 2026
Viaarxiv icon

VIPER: Process-aware Evaluation for Generative Video Reasoning

Add code
Dec 31, 2025
Viaarxiv icon